Knowledge-Based Information Extraction: A Case Study of Recognizing Emails of Nigerian Frauds

نویسندگان

  • Yanbin Gao
  • Gang Zhao
چکیده

This paper describes the methodology, process and results of developing an application ontology as software specification of the semantics of forensics in the email suspicious of Nigerian frauds. Real life examples of fraud emails are analyzed for evidence and red flags to capture the underlying domain semantics with an application ontology of frauds. A model of the natural language structure in regular expressions is developed in the light of the ontology and applied to emails to extract linguistic evidences of frauds. The evaluation of the initial results shows a satisfactory recognition as an automatic fraud alert system. It also demonstrates a methodological significance: the methodical conceptual modeling and specific purpose-driven linguistic modeling are effective in encapsulating and managing their respective needs, perspectives and variability in real life linguistic processing applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Solvent Extraction of Zinc with Triphenylphosphite (TPP) from a Nigerian Sphalerite in Hydrochloric

A hydrmetallurgical study based on the extraction of Zn and Pb from a Nigerian sphalerite mineral leached with 4 M hydrochloric acid has been undertaken. Triphenylphosphite (TPP) has proved to be very effective for the extraction of zinc. With 2 M TPP and at ambient temperature of 25±2°C; 35.71% and 66.7% of Pb and Zn were extracted, respectively, into the organic phase within 60 min. The recov...

متن کامل

Assessing the information literacy of Farhangian University student-teachers from six perspectives: reviewing, recognizing sources, disseminating, recognizing, flexibility and seeking information

Background: Today, the scope of science and knowledge and, consequently, its measurement has become very common. The use of this information in the university environment depends on the students' knowledge of the places where the information is disseminated and how it is used and the methods of retrieving and using that information. The aim of this study was to investigate the information liter...

متن کامل

Investigating Non-Native English Speaking Graduate Students’ Pragmatic Development in Requestive Emails

The present study investigated learners’ interlanguage pragmatic development through analysis of 99 requestive emails addressed to a faculty member over a period of up to two years. Most previous studies mainly investigated how non-native English speaking students’ (NNESs) pragmalinguistic and sociopragmatic competence differed from native English speaking students (NESs) and compared learners ...

متن کامل

Using the Wisdom of Crowds to Prevent Internet Frauds

With the rapid growth of the netizen population in China, more and more internet frauds are committed. Many people suffer from internet frauds by losing wealth or other valuable things. To prevent internet frauds, we first need to discover the methods in which internet frauds are conducted. In this paper, we investigate and categorize the internet frauds in China. So far, there are typically si...

متن کامل

Digital Waste Sorting: A Goal-Based, Self-Learning Approach to Label Spam Email Campaigns

Fast analysis of correlated spam emails may be vital in the effort of finding and prosecuting spammers performing cybercrimes such as phishing and online frauds. This paper presents a self-learning framework to automatically divide and classify large amounts of spam emails in correlated labeled groups. Building on large datasets daily collected through honeypots, the emails are firstly divided ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005